The QMUL Team with Probabilistic SQL at Enterprise Track

نویسندگان

  • Thomas Roelleke
  • Elham Ashoori
  • Hengzhi Wu
  • Zhen Cai
چکیده

The enterprise track caught our attention, since the task is similar to a project we carried our for the BBC. Our motivation for participation has been twofold: On one hand, there is the usual challenge to design and test the quality of retrieval strategies. On the other hand, and for us very important, the TREC participation has been an opportunity to investigate the resource effort it requires to deliver a TREC result. Our main findings from this TREC participation are: 1. Through the consequent usage of our probabilistic variant of SQL, we could describe retrieval strategies within a few lines of code. 2. The processing time proved sufficient to deal with the collection. 3. The abstraction-oriented data modelling layers of our HySpirit framework enable relatively junior researches to explore a TREC collection and submit runs. 4. For the less complex retrieval tasks (discussion search, known-item search), minimal resources lead to acceptable results, whereas for the more complex retrieval tasks (expert search), inclusion and combination of all available evidence appear to significantly improve retrieval quality.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Critical Success Factors for Business Intelligence Implementation in an Enterprise Resource Planning System Environment Using DEMATEL: A Case Study at a Cement Manufacture Company in Indonesia

This paper is aimed at evaluating critical success factors in Business Intelligence (BI) implementation in an Enterprise Resource Planning (ERP) environment. The data analysis method used in this paper is the Decision Making Trial and Evaluation Laboratory Model (DEMATEL). The study has been conducted on a cement manufacturing strategic holding company that has implemented ERP since 2010. This ...

متن کامل

The Lowlands' TREC Experiments 2005

This paper describes our participation to the TREC HARD track (High Accuracy Retrieval of Documents) and the TREC Enterprise track. The main goal of our HARD participation is the development and evaluation of so-called query profiles: Short summaries of the retrieved results that enable the user to perform more focused search, for instance by zooming in on a particular time period. The main goa...

متن کامل

Probabilistic Databases: Where and How

Modern enterprise applications are forced to deal with unreliable, inconsistent and imprecise information in applications like search or business-intelligence. We propose here to use a probabilistic database to model such imprecisions and support complex, top-k SQL queries with ranked answers. We model all types of imprecisions as probabilistic data and evaluate SQL using a probabilistic semant...

متن کامل

Research on Enterprise Track of TREC 2007

We (ICT-CAS team) participated in the Enterprise Track of TREC 2007. This paper reports our experimental results on this track.

متن کامل

Using WSRM to Track SQL Server’s Resource Usage

Microsoft’s Windows System Resource Manager (WSRM) is a workload management tool included with Windows Server 2003 Enterprise or Datacenter. Administrators can use WSRM to control how CPU and memory are shared among competing processes. WSRM is typically used to normalize consolidated workloads (reducing the risk that a misbehaving application will interfere with others on the system), or to en...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005